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Universality in Percolation of arbitrary Uncorrelated Nested Subgraphs 
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The study of percolation in so-called nested subgraphs implies a generalization of the concept of 
percolation since the results are not linked to specific graph process. Here the behavior of such graphs 
at criticallity is studied for the case where the nesting operation is performed in an uncorrelated 
way. Specifically, I provide an analyitic derivation for the percolation inequality showing that the 
cluster size distribution under a generalized process of uncorrelated nesting at criticality follows a 
power law with universal exponent 7 = 3/2. The relevance of the result comes from the wide variety 
of processes responsible for the emergence of the giant component that fall within the category of 
nesting operations, whose outcome is a family of nested subgraphs. 



I. INTRODUCTION 



Random graph theory is based on an ensemble formal- 
ism [l|, close to statistical mechanics [1]. Under this 
framework, some purely mathematical phenomena, like 
the emergence of the giant connected component (here- 
after, GCC) Q can be understood as a phase transition 
in the sense of, for example, the transition from ferro- 
magnetic to paramagnetic phase in the Ising model. In 
Landau's theoretical hallmark to study phase transitions, 
one of the main features of a system at criticality is that 
some thermodynamical magnitude m (the order param- 
eter of the system) displays a singularity in one of its 
derivatives Under mild assumptions, one can show 
that m approaches to the singularity as a power law of a 
control parameter e, i.e., m(e) cx . Following the statis- 
tical mechanics analogy, in this note I study the critical 
exponents related to the component size distribution at 
the critical point when the GCC of an arbitrary family 
of nested subgraphs emerges. 

In a previous paper JS], we defined and characterized 
the so-called nested subgraphs, which are a collection 
of families of subgraphs of a given graph whose mem- 
bers can be ordered by inclusion. We assume that these 
subgraphs are obtained through an arbitrary algorithm 
whose outcome holds some probabilistic requirements. 
The main achievement of the developed formalism was 
that the results were not linked to a specific subgraph, 
but they were general to all subgraphs satisfying a small 
set of probabilistic constraints. Among others, we iden- 
tified as nested subgraphs the families of K-coies [l], 
0, the X-scaffolds [ill o'" the subgraphs obtained 
through random deletion of nodes p^ . It is worth 
noting that the study of critical thresholds for the emer- 
gence of the GCC of an arbitrary family of nested sub- 
graphs implies a generalization of the concept of perco- 
lation based in a threshold in the probability of deletion 
of randomly chosen nodes. Furthermore, we showed that 
the degree distribution of a scale-free network with expo- 
nent higher than 2 was invariant under nesting operations 
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i. 

In this work I study the emergence of cluster sizes at 
criticality when an uncorrelated nesting algorithm is ap- 
plied, which implies that the probabilty of removal or 
survival of a given node can be expressed as a function 
of its connectivity and, in the extreme cases, of both its 
connectivity and a mean field approach of the connectiv- 
ity of its first neighbors. We observe that the possible 
long range dependencies conditioning the emergence of 
the K-cove automatically rules out such a subgraph from 
our study, even a probabilistic interpretation of the prob- 
ability of removal can be defined [y] . 

The behavior of cluster sizes at criticality is studied 
following a methodology based on generating functions 
7]. Generating functions to study the emergence of the 
GCC were introduced for the first time in 113. How- 
ever, in this work I use the proposal made in |l2|.|T3|.jT6|. 
also based on the generating function formalism, which 
revealed specially suitable to study network phenomena 
from the physical point of view. In this framework, the 
study of the emergence of the GCC resembles the study 
of phase transitions under Landau's theoretical frame- 
work Q. With this mathematical apparatus I show that 
the probability distribution for the size of components at 
criticality follows a power-law with universal exponent 
7 = 3/2, no matter the kind of subgraph is emerging. 
Previous work derived this exponent for ordinary perco- 
lation jl3] and for the emergence of the giant if -core [13] ■ 
The relevance of this result comes from the wide variety 
of processes that lead to the emergence (or disappear- 
ance) of the GCC which can be included in the category 
of nesting operations. 



II. UNCORRELATED NESTED SUBGRAPHS 

Formally, a complex network is topologically described 
by a graph G{V, T) where V is the set of nodes and T g 
V xV the set of edges connecting nodes oiV . If pk is the 
probability that a randomly chosen node e is connected 
to k other nodes (noted d{e) = k), then the collection 
of pfc's defines a sequence of real numbers {pk}kLi (the 
so-called degree distribution) whose generating functions 
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are 



0: 



9o{z) = ^Pkz''; gi{z) 



where 



Id 



Let us define the generating functions for an arbitrary 
i^T-nested subgraph with an associated nesting function 
ifiKik) defined on a graph Q with arbitrary degree dis- 
tribution {pfcj^i- To be precise, we are talking about 
the generating functions associated with the sequence 
{'PKik)pk}'kLi of real numbers: 



dz 



90 



{z)\z=i ^^kpk 



h{z) = ^pk<~PK{k)2 



is the average connectivity of Q. We assume that our 
{pk}kLi is, at least, 1-smooth, i.e., that (fc) < oo [isj . 

We will say that S{A, Ta) is an induced subgraph [l^ 
of Q{V,T) if A C 1/ and C r being T a A x A. 
A if-nested family of subgraphs N [(y\ is a collection of 
subgraphs of a give n g raph Q whose members can be 
ordered by inclusion |21|: 

■ .■Sk+i{Q) ^ Sk{Q) Q Sk-i{Q)- 

Let the graph Sk — Sk{Vsk ■• ^Sk) be a member of a 
nested family of subgraphs of a given graph Q. For ev- 
ery family of i^-nested subgraphs we associate a nesting 
function, (pK{k), namely the probability for a randomly 
chosen node e G V with degree d{e) = k to belong to Sk- 

ipK{k)^PieeVsJd{e)^k) 

Since tpxik) is a probability, we can express it like a 
function, 

>fK{k):UxN^ [0,1], 

where J7 C R is a set that depends on the nature of 
the nesting. We need our nesting functions to fulfill the 
following conditions: 

1. fixed K, ipxik) is a non-decreasing function on k, 

2. fixed k, Lp^ik) is a non-increasing function on K 
and, 

3. (V(^k)[(3As,, e (0,l])|(limfe^„o^K(fc)) = XskI 
where Ag^ is a scalar whose value will depend on 
the explicit form of the nesting algorithm. 

To be consistent with the characterization, we make ex- 
plicit an underlying assumption: Let 5^,5'^ be a pair 
of subgraphs of some subgraph Q with associated nesting 
functions (j)K and (pK , respectively. Then we assume that 

(Vfc)(0K(fc) > 'fiKik)) ^ (5^ C 



K) 



We observe that a nesting function takes into account 
all the nodes satisfying the imposed conditions: our sub- 
graphs are maximal under the conditions imposed by the 
nesting function. Furthermore, from the properties of the 
nesting functions, we can conclude that the sequence 

{ipK{k)}T ^VK{l),VKm,...,m{i).- 
is a Cauchy sequence. 



^ oo 

J^^kpk<fK{k)z 



k-l 



Notice that, generally, /o(l),/i(l) < 1. For the sake 
of completeness, the section ends with the assymptotic 
expression that accounts for the degree distribution of 
the nested subgraphs, ps^, as found in Q: 



PSk (k) 



{Mi))Hi-fi{i)y-'p^ 



;;|f)E(y(Aa))^a-Aa)r-- 



Ml) kl dz'' 



^=i-/i(i) 



We observe that the second step is valid from the fact 
that (fc)}i° is a Cauchy sequence. 



III. BEHAVIOR AT CRITICALITY 

Once the operation of nesting is accomplished, the ob- 
tained subgraph can display many components of several 
sizes, including, in some cases, one component of infinite 
size containing a finite fraction of all nodes, the GCC. 
Let TTs be the probability for a randomly chosen node e 
to belong to a component with s nodes. We observe that 
the collection of tt^'s form a sequence of real numbers 
{tTs]^i whose associated generating functions are: 



ho{z) 
hi{z) 



J2^sz' 

S 



is) 



(1) 

(2) 



being 



the average size of components other than the GCC. If 
/io(l) is the probability that a randomly chosen node e is 
not in the CCC, then the probability for such a node to 
belong to the CCC, noted tTqc, will be tTqc, = 1 — ho{l). 
However, this formulation does not help us to understand 
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the problem. Following techniques close to the ones de- 
veloped to study branching processes, we can find an al- 
ternative form for ho and hi. Indeed, it can be shown 
that hi displays a Dyson-like recurrence relation [l6| . 

hi{z) = I - fi{l) + z^^k(1) + z^^Ki2)hiiz) + 



+ z^^^{3)hl{z) + ... 
= I- fi{l) + zfi{hi{z)) 



(3) 



and that the generating function for the size of the com- 
ponent to which a randomly chosen node belongs to is: 



hoiz) - 1 
With this formulation, 



fo{l) + zfo{hi{z)). 



/o(l) - /o("), 



(4) 



(5) 



where u is the first, non-trivial solution of the self- 
consistent equation m = 1 — /i(l) -I- fi{u) [111- F^i"" 
thermore, from the above definition of ho wc can obtain 
a useful expression of (s) : 



dz 

Ml) 



ho{z) 



z=l 

^ /o(2)U = l 



/l(l) 



dz 



1 



(6) 



As in modern theory of phase transitions, the main fea- 
ture of this phase transition is the existence of a singular- 
ity in some thermodynamic/statistical magnitude Q. In 
our case, the phase transition can be identified with the 
singularity we find in the component size distribution, 
(s) (eq. HI), at: 



dz 



= 1. 



(7) 



z=l 



Before the transition, ttoo = 0, being all components of 
finite size and, after the transition, tToo > 0, and the re- 
maining components display still finite size. Specifically, 
from eqs. (0 O [11, [I it can be shown that if: 

J2 k{k - 2)pk >Y,k{k^ 1)(1 - ipK{k))pu 

k k 

then there exists a single component of infinite size con- 
taining a finite fraction of nodes, i.e., the GCC. The 
phase transition referred also as the percolation thresh- 
old, is located at the point where: 

^ fc(fc - 2)pk ^Y.^{k^ 1)(1 - ^K{k))Pk- 



The critical region is located near the percolation thresh- 
old (if it exists), i.e., in the region near ([7]). To study the 
behavior of the cluster size distribution near the singu- 
larity, we look at the expression of h^^z), both depending 
on fo and hi. Nevertheless, we assume that j;fo{z) con- 
verges for any |z| < 1 (i.e. (fc) < oo and well defined, 
being {^K{k)pk}kLi ^-t least 1-smooth). Thus we must 
look for the singularity in hi. To study hi near the tran- 
sition, we define its functional inverse, h^^{T) = z: 



Mr) 



(8) 



Note that, due to the fact that all the members of the 
sequence {vK{k)pk}'kLi are not negative, we can be sure 
that the of zeros of fi fall outside the statistically relevant 
region -and, hence the poles of h^^. Thus we assume, 
without any loss of generality, that fi{z) ^ 0. Con- 
sistently, we expect to find the singularity at the point 
where 



= 0. 



Differentiating eq. ([5]), we see that: 



/i(r*)-(r*-l + /i(l)) -A(z) 



= 



As we argued above, we expect the phase transition of 
the system to occur at j^fi{z)\z=i = 1- Thus, if r* = 1, 
all the terms are cancelled. Furthermore, from (|8]) we 
can see that, if r* = 1, then, z* = 1. Collecting the 
above ingredients, and assuming that h^^ is analytical 
near the singularity of hi, we can perform the power 
series expansion of h^^ about 1: 

h^^z) = 1 + hi\z)U^iil-z)^ 



2fiil) dz^ 



M^) 



(l-z)2 + 0(l-z)= 



(recall that that -^h];'^\r=i = 0). We can assume without 
any loss of generality that: 



2/2(1) 



Mz) 



^0. 



Thus, knowing that h^ ^ {hi{z)) — z, we are legitimated 
to say that, near z = 1: 



1 - 



2/2(1) dz2 



Mz] 



il-hi{z))' 



This enables us to find the exponent /3, indicating the 
power-law behavior of hi(z) near the singularity. Specif- 
ically, 



hiiz) w 1 — c\/l — z 
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being c a constant depending on the values of both /i(l) 
and j^fi{z)\z=i- Thus, near the transition, hi{z) oc 
(1 - z)^, with P = 1/2, the standard mean field expo- 
nent. We observe that hQ{z) behaves identically near the 
singularity. Indeed, if we are close to z = 1: 

ho{z)(x {1^ zf +0{1~ z). 

However, we did not end the job, since we are also inter- 
ested in the cluster size probability distribution {tTs}^!. 
We attack the problem by expanding in power series the 
leading term of Hq^z) when z is close to 1: 

Notice that we have an approximation of hQ{z) in its 
original form given in eqs. (jll2p . Thus, 



s- 13-1 



r(-/3)r(s + i) 

1 /s-P-l' 




s-,3-1 



V2^(s-/3-l) 



r(-/3) 



where, in this case F, refers to the ordinary Gamma Func- 
tion and the last step is obtained assuming s — )■ cx) and, 



hence, applying Stirling's approach [20|. Since (3 = 1/2, 
in the limit of large s: 

TTsOis-'^; 7 = 1+/?=^. (9) 



IV. DISCUSSION 

In this short note I demonstrated that a wide variety 
of graph processes display the same behaviour at criti- 
cality. Specifically, given any iterative nesting operation, 
we expect the cluster size distribution to follow a power 
law with universal exponent 7 = 3/2 at the critical re- 
gion where the giant component emerges. As pointed out 
concerning the K-core in 17[ , the emerging components 
do respect the connectivity requirements imposed by the 
nesting algorithm, a feature that goes far from ordinary 
percolation, where only to be connected is required. Be- 
yond its intrinsic theoretical interest, the broad class of 
mechanisms that can be described through a nesting al- 
gorithm makes the universality of this result potentially 
powerful to understand natural phenomena at critical- 
ity where some kind of non-correlated pruning/addition 
process is at work. 
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